Identifying Key Phoneme Features
نویسندگان
چکیده
Spectrograms carry all necessary information for reliable human and computer perception of speech. This paper discusses the importance of spectrogram features used by a recognition algorithm developed by Ali et al. as they relate to human perception. Features, including MNSS, burst frequency, formant transitions, voicing onset time, and voicing/unvoicing information are defined and their importance to computer stop consonant recognition described. Confirming many previous findings, burst frequency and formant transitions were found to be most important in the perception of speech synthesized from spectrograms while other features played a secondary role. Software tools developed that should facilitate other similar investigations are described.
منابع مشابه
Phoneme Classification Using Temporal Tracking of Speech Clusters in Spectro-temporal Domain
This article presents a new feature extraction technique based on the temporal tracking of clusters in spectro-temporal features space. In the proposed method, auditory cortical outputs were clustered. The attributes of speech clusters were extracted as secondary features. However, the shape and position of speech clusters change during the time. The clusters temporally tracked and temporal tra...
متن کاملAllophone-based acoustic modeling for Persian phoneme recognition
Phoneme recognition is one of the fundamental phases of automatic speech recognition. Coarticulation which refers to the integration of sounds, is one of the important obstacles in phoneme recognition. In other words, each phone is influenced and changed by the characteristics of its neighbor phones, and coarticulation is responsible for most of these changes. The idea of modeling the effects o...
متن کاملبهبود عملکرد سیستم بازشناسی گفتار پیوسته بوسیله ویژگیهای استخراج شده از مانیفولدهای گفتاری در فضای بازسازی شده فاز
The design for new feature extraction methods out of the speech signal and combination of their obtained information is one of the most effective approaches to improve the performance of automatic speech recognition (ASR) system. Recent researches have been shown that the speech signal contains nonlinear and chaotic properties, but the effects of these properties are not used in the continuous ...
متن کاملLanguage identification and accent variation detection in spoken language recordings
We develop a model for identifying languages and accents in audio recordings. Our Hierarchical-Sequential Nodule Model (HSNM) incorporates both short-distance features (which capture simple linguistic distinctions, e.g. phoneme inventories) and longdistance features (which detect long-distance suprasegmental patterns, e.g. tone and prosody) which help a classifier discriminate intelligently amo...
متن کاملطراحی الگوریتم بازشناسی واجها با به کارگیری همبسته های آکوستیکی مشخصه های واجی
In the present paper, the phonological feature geometry of the Persian phonemes is analyzed in the form of articulate-free and articulate-bound features based on the articulator model of the nonlinear phonology. Then, the reference phonetic pattern of each feature that consists of one or a set of acoustic correlates, characterized by the quantitative or qualitative values in its phonological re...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1999